Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 20 de 21
Filter
1.
Tien Tzu Hsueh Pao/Acta Electronica Sinica ; 51(1):202-212, 2023.
Article in Chinese | Scopus | ID: covidwho-20245323

ABSTRACT

The COVID-19 (corona virus disease 2019) has caused serious impacts worldwide. Many scholars have done a lot of research on the prevention and control of the epidemic. The diagnosis of COVID-19 by cough is non-contact, low-cost, and easy-access, however, such research is still relatively scarce in China. Mel frequency cepstral coefficients (MFCC) feature can only represent the static sound feature, while the first-order differential MFCC feature can also reflect the dynamic feature of sound. In order to better prevent and treat COVID-19, the paper proposes a dynamic-static dual input deep neural network algorithm for diagnosing COVID-19 by cough. Based on Coswara dataset, cough audio is clipped, MFCC and first-order differential MFCC features are extracted, and a dynamic and static feature dual-input neural network model is trained. The model adopts a statistic pooling layer so that different length of MFCC features can be input. The experiment results show the proposed algorithm can significantly improve the recognition accuracy, recall rate, specificity, and F1-score compared with the existing models. © 2023 Chinese Institute of Electronics. All rights reserved.

2.
Comput Biol Med ; 163: 107153, 2023 Jun 08.
Article in English | MEDLINE | ID: covidwho-20233898

ABSTRACT

This study proposes a new deep learning-based method that demonstrates high performance in detecting Covid-19 disease from cough, breath, and voice signals. This impressive method, named CovidCoughNet, consists of a deep feature extraction network (InceptionFireNet) and a prediction network (DeepConvNet). The InceptionFireNet architecture, based on Inception and Fire modules, was designed to extract important feature maps. The DeepConvNet architecture, which is made up of convolutional neural network blocks, was developed to predict the feature vectors obtained from the InceptionFireNet architecture. The COUGHVID dataset containing cough data and the Coswara dataset containing cough, breath, and voice signals were used as the data sets. The pitch-shifting technique was used to data augmentation the signal data, which significantly contributed to improving performance. Additionally, Chroma features (CF), Root mean square energy (RMSE), Spectral centroid (SC), Spectral bandwidth (SB), Spectral rolloff (SR), Zero crossing rate (ZCR), and Mel frequency cepstral coefficients (MFCC) feature extraction techniques were used to extract important features from voice signals. Experimental studies have shown that using the pitch-shifting technique improved performance by around 3% compared to raw signals. When the proposed model was used with the COUGHVID dataset (Healthy, Covid-19, and Symptomatic), a high performance of 99.19% accuracy, 0.99 precision, 0.98 recall, 0.98 F1-Score, 97.77% specificity, and 98.44% AUC was achieved. Similarly, when the voice data in the Coswara dataset was used, higher performance was achieved compared to the cough and breath studies, with 99.63% accuracy, 100% precision, 0.99 recall, 0.99 F1-Score, 99.24% specificity, and 99.24% AUC. Moreover, when compared with current studies in the literature, the proposed model was observed to exhibit highly successful performance. The codes and details of the experimental studies can be accessed from the relevant Github page: (https://github.com/GaffariCelik/CovidCoughNet).

3.
4th International Conference on Computer and Applications, ICCA 2022 ; 2022.
Article in English | Scopus | ID: covidwho-2283686

ABSTRACT

Respiratory infections are a confusing and time-consuming task that caused recently a pandemic that affected the whole world. One of the pandemics was COVID-19 that has exposed the vulnerability of medical services across the world, particularly in underdeveloped nations. There comes a strong demand for developing new computer-assisted diagnosis tools to present cost-effective and rapid screening in locations wherein enormous traditional testing is impossible. Medical imaging becomes critical for diagnosing disease, X-rays and computed tomography (CT) scan are employed in the deep network which will be helpful in diagnosing diseases. This paper proposes a scanning model based on using a Mel Frequency Cepstral Coefficients (MFCC) features extracted from a respiratory virus CT-Scan image and then filtered by applying Gabor filter (GF). The filtered image is passed to Convolutional Neural Network (CNN) for classifying the image for the presence of a respiratory virus such as Covid, Viral Pneumonia or being a healthy normal image. The proposed system achieved a validation accuracy of 100% with an overall accuracy of 99.44%. © 2022 IEEE.

4.
Computer Systems Science and Engineering ; 46(2):2337-2349, 2023.
Article in English | Scopus | ID: covidwho-2283144

ABSTRACT

This research is focused on a highly effective and untapped feature called gammatone frequency cepstral coefficients (GFCC) for the detection of COVID-19 by using the nature-inspired meta-heuristic algorithm of deer hunting optimization and artificial neural network (DHO-ANN). The noisy crowdsourced cough datasets were collected from the public domain. This research work claimed that the GFCC yielded better results in terms of COVID-19 detection as compared to the widely used Mel-frequency cepstral coefficient in noisy crowdsourced speech corpora. The proposed algorithm's performance for detecting COVID-19 disease is rigorously validated using statistical measures, F1 score, confusion matrix, specificity, and sensitivity parameters. Besides, it is found that the proposed algorithm using GFCC performs well in terms of detecting the COVID-19 disease from the noisy crowdsourced cough dataset, COUGHVID. Moreover, the proposed algorithm and undertaken feature parameters have improved the detection of COVID-19 by 5% compared to the existing methods. © 2023 CRL Publishing. All rights reserved.

5.
Cognit Comput ; : 1-16, 2022 Oct 12.
Article in English | MEDLINE | ID: covidwho-2248818

ABSTRACT

COVID-19 (coronavirus disease 2019) is an ongoing global pandemic caused by severe acute respiratory syndrome coronavirus 2. Recently, it has been demonstrated that the voice data of the respiratory system (i.e., speech, sneezing, coughing, and breathing) can be processed via machine learning (ML) algorithms to detect respiratory system diseases, including COVID-19. Consequently, many researchers have applied various ML algorithms to detect COVID-19 by using voice data from the respiratory system. However, most of the recent COVID-19 detection systems have worked on a limited dataset. In other words, the systems utilize cough and breath voices only and ignore the voices of the other respiratory system, such as speech and vowels. In addition, another issue that should be considered in COVID-19 detection systems is the classification accuracy of the algorithm. The particle swarm optimization-extreme learning machine (PSO-ELM) is an ML algorithm that can be considered an accurate and fast algorithm in the process of classification. Therefore, this study proposes a COVID-19 detection system by utilizing the PSO-ELM as a classifier and mel frequency cepstral coefficients (MFCCs) for feature extraction. In this study, respiratory system voice samples were taken from the Corona Hack Respiratory Sound Dataset (CHRSD). The proposed system involves thirteen different scenarios: breath deep, breath shallow, all breath, cough heavy, cough shallow, all cough, count fast, count normal, all count, vowel a, vowel e, vowel o, and all vowels. The experimental results demonstrated that the PSO-ELM was capable of attaining the highest accuracy, reaching 95.83%, 91.67%, 89.13%, 96.43%, 92.86%, 88.89%, 96.15%, 96.43%, 88.46%, 96.15%, 96.15%, 95.83%, and 82.89% for breath deep, breath shallow, all breath, cough heavy, cough shallow, all cough, count fast, count normal, all count, vowel a, vowel e, vowel o, and all vowel scenarios, respectively. The PSO-ELM is an efficient technique for the detection of COVID-19 utilizing voice data from the respiratory system.

6.
Lecture Notes in Networks and Systems ; 428:45263.0, 2023.
Article in English | Scopus | ID: covidwho-2243440

ABSTRACT

Mel frequency cepstral coefficients are one of the most prominent sets of primary features of an audio signal which are used for speech detection and cough analysis. This paper presents a new method that can overcome some of the common problems faced by using MFCCs for cough detection. In the proposed method, the most prominent part of the cough sample (HCP) is extracted and used to obtain the MFCC vectors of that particular window. These HCP MFCC vectors work as a standard comparison index for all cough samples to detect any respiratory disorders. The evaluation of the proposed method is done using 40 samples of COVID-19 patients of which 20 are positive and 20 are negative. The accuracy of the proposed method is compared with that of the standard MFCC method for the same set of samples. The proposed HCP MFCC method produces results that are 7.84% more accurate than the standard method. By bringing a standard set of comparing features that can work for almost all use cases, this method can be used as a quick identifying tool for various respiratory diseases. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

7.
2nd IEEE Mysore Sub Section International Conference, MysuruCon 2022 ; 2022.
Article in English | Scopus | ID: covidwho-2192042

ABSTRACT

Infection with the SARS-CoV-2 virus results in Covid 19, an infectious illness. Most persons who get Coronavirus will only experience mild to moderate symptoms and will get better without any special care. Some people get very sick and need medical attention. The rising mortality toll from COVID-19 underscores the importance of developing methods for early detection of the disease, which might aid in containing the epidemic and facilitating the creation of tailored mitigation strategies. Current research in chaotic dynamics indicates that coughs and other vocal sounds include lung health data that can be used for symptomatic reasons. Mel frequencies Cepstral Coefficients (MFCC) are applied to cough samples, and then the audio data from coughs is fed into a GridsearchCV model using a KNN-based classification method. Our model was developed using 217 samples from training data and 55 from testing data. Cough tests conducted on both males and females are included in the dataset. An evaluation found that the model had an accuracy of 83.3%. © 2022 IEEE.

8.
Comput Biol Med ; 150: 106123, 2022 Oct 03.
Article in English | MEDLINE | ID: covidwho-2177827

ABSTRACT

The recent investigation has started for evaluating the human respiratory sounds, like voice recorded, cough, and breathing from hospital confirmed Covid-19 tools, which differs from healthy person's sound. The cough-based detection of Covid-19 also considered with non-respiratory and respiratory sounds data related with all declared situations. Covid-19 is respiratory disease, which is usually produced by Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2). However, it is more indispensable to detect the positive cases for reducing further spread of virus, and former treatment of affected patients. With constant rise in the COVID-19 cases, there has been a constant rise in the need of efficient and safe ways to detect an infected individual. With the cases multiplying constantly, the current detecting devices like RT-PCR and fast testing kits have become short in supply. An effectual Covid-19 detection model using devised hybrid Honey Badger Optimization-based Deep Neuro Fuzzy Network (HBO-DNFN) is developed in this paper. Here, the audio signal is considered as input for detecting Covid-19. The gaussian filter is applied to input signal for removing the noises and then feature extraction is performed. The substantial features, like spectral roll-off, spectral bandwidth, Mel frequency cepstral coefficients (MFCC), spectral flatness, zero crossing rate, spectral centroid, mean square energy and spectral contract are extracted for further processing. Finally, DNFN is applied for detecting Covid-19 and the deep leaning model is trained by designed hybrid HBO algorithm. Accordingly, the developed Hybrid HBO method is newly designed by incorporating Honey Badger optimization Algorithm (HBA) and Jaya algorithm. The performance of developed Covid-19 detection model is evaluated using three metrics, like testing accuracy, sensitivity and specificity. The developed Hybrid HBO-based DNFN is outpaced than other existing approaches in terms of testing accuracy, sensitivity and specificity of "0.9176, 0.9218 and 0. 9219". All the test results are validated with the k-fold cross validation method in order to make an assessment of the generalizability of these results. When k-fold value is 9, sensitivity of existing techniques and developed JHBO-based DNFN is 0.8982, 0.8816, 0.8938, and 0.9207. The sensitivity of developed approach is improved by means of gaussian filtering model. The specificity of DCNN is 0.9125, BI-AT-GRU is 0.8926, and XGBoost is 0.9014, while developed JHBO-based DNFN is 0.9219 in k-fold value 9.

9.
2022 IEEE Biomedical Circuits and Systems Conference, BioCAS 2022 ; : 228-232, 2022.
Article in English | Scopus | ID: covidwho-2152431

ABSTRACT

Respiratory diseases have seriously impacted human life in the last couple of years;as Covid 19 arrived, many lost their beloved ones. Since respiratory diseases directly attack the patient's lungs, it is becoming risky day by day for human life and doctors because a confined number of resources are available in hospitals to detect these respiratory diseases, and detection of these diseases is a difficult job to the doctors. Therefore early-stage diagnosis can help the doctor in saving human lives. Researchers are continuously trying to help doctors by designing efficient and more accurate tools for detecting different types of respiratory diseases. This paper uses a convolution-based deep learning model to classify these respiratory diseases using patient respiratory sound signals with Mel frequency cepstral coefficients (MFFCs) as a feature vector. In this paper, we have tried to keep our neural network model as simple as possible with less trainable parameters and good classification accuracy. The model performance is measured in terms of sensitivity, specificity, average score, and harmonic score. © 2022 IEEE.

10.
Knowl Based Syst ; 253: 109539, 2022 Oct 11.
Article in English | MEDLINE | ID: covidwho-1966919

ABSTRACT

Alongside the currently used nasal swab testing, the COVID-19 pandemic situation would gain noticeable advantages from low-cost tests that are available at any-time, anywhere, at a large-scale, and with real time answers. A novel approach for COVID-19 assessment is adopted here, discriminating negative subjects versus positive or recovered subjects. The scope is to identify potential discriminating features, highlight mid and short-term effects of COVID on the voice and compare two custom algorithms. A pool of 310 subjects took part in the study; recordings were collected in a low-noise, controlled setting employing three different vocal tasks. Binary classifications followed, using two different custom algorithms. The first was based on the coupling of boosting and bagging, with an AdaBoost classifier using Random Forest learners. A feature selection process was employed for the training, identifying a subset of features acting as clinically relevant biomarkers. The other approach was centered on two custom CNN architectures applied to mel-Spectrograms, with a custom knowledge-based data augmentation. Performances, evaluated on an independent test set, were comparable: Adaboost and CNN differentiated COVID-19 positive from negative with accuracies of 100% and 95% respectively, and recovered from negative individuals with accuracies of 86.1% and 75% respectively. This study highlights the possibility to identify COVID-19 positive subjects, foreseeing a tool for on-site screening, while also considering recovered subjects and the effects of COVID-19 on the voice. The two proposed novel architectures allow for the identification of biomarkers and demonstrate the ongoing relevance of traditional ML versus deep learning in speech analysis.

11.
45th Jubilee International Convention on Information, Communication and Electronic Technology, MIPRO 2022 ; : 912-917, 2022.
Article in English | Scopus | ID: covidwho-1955354

ABSTRACT

Detection of respiratory viruses is a perplexing task which regularly requires saving time by taking a quick look at clinical images of patients ceaselessly. Hence, there's a need to propose and develop a model to predict the respiratory viruses (COVID-19) cases at the earliest possible to control the spread of disease. Deep learning makes it possible to find out that Covid-19 can be detected in an efficient way using its classification tools such as CNN (Convolutional Neural Network). MFCC (Mel Frequency Cepstral Coefficients) is a very common and efficient technique for signal processing. In this research, a MFCC - CNN learning model to hasten the prediction process is proposed that assist the medical professionals. MFCC is used for extracting the image's features concerning existence of COVID-19 or not. Classification is performed by using convolutional neural network. This makes the time-consuming process easier and faster with more accurate results for radiologists and this reduces the spread of virus and save lives. Experimental results shows that using CT image converted to Mel-frequency cepstral coefficient spectrogram images as input to a CNN can achieve a high accuracy results;with classification of validation data scoring an accuracy of 99.08% correct classification of COVID and NON COVID labeled images. Hence, it can be used practically for detection of COVID-19 from CT images. The work here provides a proof of concept that high accuracy can be achieved with a moderate dataset, which can have a significant impact in this area. © 2022 Croatian Society MIPRO.

12.
1st International Conference on Applied Artificial Intelligence and Computing, ICAAIC 2022 ; : 232-237, 2022.
Article in English | Scopus | ID: covidwho-1932084

ABSTRACT

Covid 19 is an infectious disease that is caused by infection due to SARS-CoV-2 virus. The vast majority of people infected with Corona virus will encounter mild to moderate symptoms and recover without any special treatment. In some case, some people become seriously ill and require clinical consideration. Because of the increase in number of death due to COVID-19, an techniques for the early discovery of the illness is very much needed that might assist with restricting its spread just as help in the development of targeted surrounding solutions. Coughs and other vocal sounds contain pulmonary health data that can be utilized for symptomatic purposes, and ongoing examinations in chaotic dynamics have shows a nonlinear phenomenon exists in vocal signs. Cough samples are transformed with Mel frequency Cepstral Coefficients (MFCC) and the cough audio data is fitted into a GridsearchCV model with KNN based classification algorithm. The number of training data for used for training our model is 217 and remaining 55 data were used for testing the model. The dataset contains the cough tests from both male and female. When evaluated the model could get a precision of 83.3%. © 2022 IEEE.

13.
2nd International Conference on Artificial Intelligence and Signal Processing, AISP 2022 ; 2022.
Article in English | Scopus | ID: covidwho-1846053

ABSTRACT

To combat the Covid-19 outbreak, the education system shifted away from the classroom to distinct e-learning on digital platforms, which made effective use of voice-based recognition systems, especially for preliterate children. Children’s speech recognition systems face multiple challenges owing to their immature vocal tracts, and they demand more intelligence due to the fact that children with diverse accents utter words differently. Accent refers to a unique style of pronouncing a language, particularly one associated with a specific nation, place, or socio-economic background. This paper aims to extract reliable acoustic and prosodic speech cues of accent for classification of native and non-native preschool children using harmonic pitch estimation along with Mel Frequency Cepstral Coefficients (MFCCs) to train the k-Nearest Neighbour (k-NN) classifier. The experimental results reveal that the proposed robust model outperforms various feature extractors in accent classification of native and non-native children in terms of accuracy & F-Measure and more discriminate against noisy environments. © 2022 IEEE.

14.
3rd International Conference on Electrical and Electronic Engineering, ICEEE 2021 ; : 13-16, 2021.
Article in English | Scopus | ID: covidwho-1788708

ABSTRACT

The research study presents an architecture of HumanRobot Interaction (HRI) based Artificial Conversational Entity integrated with speaker recognition ability to avail modern healthcare services. Due to the Covid-19 pandemic, the situation has become troublesome for health workers and patients to visit hospitals because of the high risk of virus dissemination. To minimize the mass congestion, our developed architecture would be an appropriate, cost-effective solution that automates the reception system by enabling AI-based HRI and providing fast and advanced healthcare services in the context of Bangladesh. The architecture consists of two significant subsections: Speaker Recognition and Artificial Conversational Entities having Automatic Speech Recognition in Bengali, Interactive Agent, and Text-to-Speech-synthesis. We used MFCC features as the linguistic parameters and the GMM statistical model to adapt each speaker's voice and estimation and maximization algorithm to identify the speaker's identity. The developed speaker recognition module performed significantly with 94.38% average accuracy in noisy environments and 96.27% average accuracy in studio quality environments and achieved a word error rate (WER) of 42.15% from RNN based Deep Speech 2 model for Bangla Automatic Speech Recognition (ASR). Besides, Artificial Conversational Entity performs with an average accuracy of 98.58% in a small-scale real-time environment. © 2021 IEEE.

15.
2021 International Conference on Forensics, Analytics, Big Data, Security, FABS 2021 ; 2021.
Article in English | Scopus | ID: covidwho-1784480

ABSTRACT

Speech is the most effective form of communication because it is not limited to just the linguistic components but carries the speaker's emotions laced within the components like tone of voice and cues like cries and sighs. This paper aims at studying the research done in the past and applying it to the Covid-19 era.The pandemic is of a great magnitude, affecting every aspect of life including emotions. This time period requires research in determining the most dominant emotions in conversations, to serve as a reference for future research and as a contrast to the research done in the past. Previous papers have identified emotions like happiness, anger, fear and sadness using feature extraction algorithms like MFCC (Mel Frequency Cepstral Coefficients and numerous classification algorithms like GMM (Gaussian Mixture Model), SVM (Support Vector Machine), KNN (K-Nearest-neighbor) and HMM (Hidden Markov Model). Some research has pointed towards ASR (Automatic Speech Recognition), N-Grams and vector space modeling. This paper aims at recognizing the most suitable algorithms for determining the pandemic specific emotions in speech. © 2021 IEEE.

16.
3rd International Conference on Computing and Data Science, CONF-CDS 2021 ; 1513 CCIS:78-90, 2021.
Article in English | Scopus | ID: covidwho-1680663

ABSTRACT

Due to the effects of respiratory diseases, a large number of people die every year. Last year, the new coronavirus COVID-19 swept the world even more, causing the huge loss of personnel and economic decline around the world. To fight against this huge epidemic, relying on hospitals only to detect the such large number of people is obviously inefficient. Therefore, this paper proposes to relieve the pressure of medical workers through sound detection, in a framework of combing the Mel Frequency Cepstral Coefficient (MFCC) and Convolutional Neural Networks (CNN). To compare different voice types and disease types, we selected three open data sets, namely, ICBHI, Coswara and Verify. Then we compared CNN with the Multi-layer Perceptron, Random Forest and XGBoost models. Finally, it is concluded that CNN has the highest accuracy rate on the three data sets. © 2021, Springer Nature Singapore Pte Ltd.

17.
2021 IEEE International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems, ICSES 2021 ; 2021.
Article in English | Scopus | ID: covidwho-1672762

ABSTRACT

The major purpose of the study topic is to use data science to anticipate the future effect of COVID-19 using existing data. The goal of this research is to use data science and analytics to generate precise forecasts of the number of substantiations and deaths. LSTM, GRUs, and Prophet are the major models created and tested for the solution. An LSTM model is a type of Recurrent Neural Network that is used to forecast datasets with increasingly changing patterns. Gated recurrent units only has two gateways: reboot and update. The prophet is best suited for forecasting assignments involving observation swith at least a year of history. The various models discussed above were used to the covid-19 data set to forecast the number of positive cases, active cases, and deaths associated with covid-19. We trained the model using data from April and May 2021 to demonstrate a comparison between the observed and expected number of positive events. To assume the future happing of COVID-19 by applying models which are in use, so that we will be able to calculate the impact of the disease's potential spread throughout the human being, preparing our selves to make proper planning and idea to prevent further transmission and equip health systems to manage the disease properly and battle the worldwide pandemic. © 2021 IEEE.

18.
7th International Conference on Electrical, Electronics and Information Engineering, ICEEIE 2021 ; 2021.
Article in English | Scopus | ID: covidwho-1672732

ABSTRACT

The rate at which technology grew in the past years is unbelievably fast and astounding. However, chronic illnesses like respiratory diseases remains a common and widely experienced problem globally. The emergence of infectious respiratory health issues such as the coronavirus (COVID-19) had only made this enigma more harmful, causing an increase in the number of death due to respiratory illnesses. Hence, the development of modern and accurate methods to improve medical diagnosis is one of the simple step's humans can perform to overcome such problems. In this study, the researchers proposed an enhanced model for lung sound classification using Mel Frequency Cepstral Coefficient (MFCC). The design will classify four different lung sounds, with data input taken and classified one at a time. The goal of which is to augment human intelligence and not to replace the existing lung sound classification methods. The pre-recorded lung sounds were characterized, and the researcher proposed four enhanced MFCC models with three varying designs. The data collected from feature extraction and data mining were evaluated by the machine learning algorithms Support Vector Machine (SVM) and K-Nearest Neighbor (KNN). Measures like sensitivity, specificity, and accuracy were tested to determine which model was superior. Results showed that in terms of performance metrics, KNN performed better than SVM in classifying lung sounds. Tested in three designs where the pre-emphasis was removed, and the original 44.1kHz data resampled. Model 3 using KNN sampled at a frequency of 12000Hz has reached an average accuracy of 96.92% and a blind-data accuracy of 93.33%. A specificity of 97.94% and a sensitivity of 93.83%, achieving a performance that is comparable with existing studies on lung sound classification. © 2021 IEEE.

19.
14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2021 ; 2021.
Article in English | Scopus | ID: covidwho-1672582

ABSTRACT

Cough is a common symptom of respiratory and lung diseases. Cough detection is important to prevent, assess and control epidemic, such as COVID-19. This paper proposes a model to detect cough events from cough audio signals. The models are trained by the dataset combined ESC-50 dataset with self-recorded cough recordings. The test dataset contains inpatient cough recordings collected from inpatients of the respiratory disease department in Ruijin Hospital. We totally build 15 cough detection models based on different feature numbers selected by Random Frog, Uninformative Variable Elimination (UVE), and Variable influence on projection (VIP) algorithms respectively. The optimal model is based on 20 features selected from Mel Frequency Cepstral Coefficients (MFCC) features by UVE algorithm and classified with Support Vector Machine (SVM) linear two-class classifier. The best cough detection model realizes the accuracy, recall, precision and F1-score with 94.9%, 97.1%, 93.1% and 0.95 respectively. Its excellent performance with fewer dimensionality of the feature vector shows the potential of being applied to mobile devices, such as smartphones, thus making cough detection remote and non-contact. © 2021 IEEE.

20.
International Journal of Advanced Computer Science and Applications ; 12(12):133-142, 2021.
Article in English | Web of Science | ID: covidwho-1619204

ABSTRACT

Covid-19 is declared a global pandemic by WHO due to its high infectivity rate. Medical attention is required to test and diagnose those with Covid-19 like symptoms. They are required to take an RT-PCR test which takes about 10-15 hours to obtain the result, and in some cases, it goes up to 3 days when the demand is too high. Majority of victims go unnoticed because they are not willing to get tested. The commonly used RT-PCR technique requires human contact to obtain the swab samples to be tested. Also, there is a shortage of testing kits in some areas and there is a need for self-diagnostic testing. This solution is a preliminary analysis. The basic idea is to use sound data, in this case, cough sounds, breathing sounds and speech sounds to isolate its characteristics and deduce if it belongs to a person who is infected or not, based on the trained model analysis. An Ensemble of Convolution neural networks have been used to classify the samples based on cough, breathing and speech samples, the model also considers symptoms exhibited by the person such as fever, cold, muscle pain etc. These Audio samples have been pre-processed and converted into Mel spectrograms and MFCC (Mel Cepstral Coefficients) are obtained that are fed as input to the model. The model gave an accuracy of 88.75% with a recall of 71.42 and Area Under Curve of 80.62%.

SELECTION OF CITATIONS
SEARCH DETAIL